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BACKGROUND OF THE INVENTION 

Field of Invention 

The present invention relates generally to the field of database systems. More 
specifically, the present invention is related to the reorganization of logically related LOB 
5 table spaces. 

Discussion of Prior Art 

A database consists of sets (or tables) of records (or rows) consisting of individual 
data fields (or columns). LOBs(large objects) are one class of data types that are stored in 
a database. There are three types of LOBs: (a) character large objects - CLOBs; binary 
10 large objects - BLOBs; and double byte character large objects - DBCLOBs. LOB table 
spaces consist of a base table space and one or more auxiliary tables which each have 
their own table space. In current database systems, the base table space and the auxiliary 
table space are reorganized independently. 

15 The following references provide a general teaching in the area of databases and 

reorganization, but they fail to provide for the limitations of the present invention's 
system and method. 

The U.S. patent to Blank et al. (5,842,208) provides for a high performance 
20 recover/build index system based upon unloading databases files in parallel. The 
recover/build index system builds a database index for a database file by scanning 
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partitions of the database file in parallel to retrieve key values and their associated record 
identifier (rid) values. The recover/build index system then sorts the scanned key/rid 
values for each partition in parallel. Next, the recover/build index system performs one 
or more merges on the sorted key/rid values from all of the partitions to generate a single 
5 key/rid value stream. Finally, the recover/build index system builds the index using the 
single key/rid value stream. 

The U.S. patent to Beavin et al. (6,038,569) provides for a system for data 
structure loading with concurrent image data. After data records are obtained from one or 
10 more data sources, each of the data records is associated with one of multiple pages. As 
each page is completed, it is written to a primary data structure. Partially or completely 
failed primary data structures may also be restored using image copies made previously. 

The U.S. patent to Bonner et al. (6,535,895) provides for a technique to avoid 
15 processing well clustered LOB's during reorganization of a LOB table space. A table 
space is reorganized in a database stored on a data storage device connected to a 
computer. When inserting or updating a LOB into a portion of the table space, a space 
map is marked to indicate whether the LOB was well inserted. When reorganizing the 
table space, when the space map indicates that a LOB was well inserted, reorganization of 
20 the portion of the table space in which the LOB was well inserted is avoided. U.S. patent 
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No. 6,606,617 and U.S. patent publication No. 2002/0065792, both by Bonner et al., 
provide for a similar teaching. 



The U.S. patent to Lewish et al. (6,237,003) provides for a method and apparatus 
5 for supporting dynamic run-time object definition in a relational database system. A 
mediating layer is introduce between the applications and the database objects. 

The U.S. application publication to Chen et al. (2002/0138497) discloses the 
implementation of a database trigger that automatically executes a set of SQL statements 

10 whenever a specified event occurs. The trigger could be reading or updating a database or 
perform other functions. The database is programmed to allow a large object (LOB) data 
to be stored with transition variables. A base table appears to comprise a database having 
a row identifier ("row'TD) and one or more non-large object columns and large object and 
large object (LOB) tables. RowK) and version are used to access the LOB data in an 

15 auxiliary table. 

The IBM TDB (Vol. 33, No.7, Dec 1990, pp 177-180) entitled "Browsing: A 
novel facility for exploring the contents of a datastore" discloses a method for navigating 
from a data element to related data elements, wherein the data elements may be related to 
20 other data elements of the same class (or another class) by named relationships. 
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Whatever the precise merits, features, and advantages of the above cited 
references, none of them achieves or fulfills the purposes of the present invention. 
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SUMMARY OF THE INVENTION 
The present invention provides for a method for reorganizing a table space in a 
database. An exemplary embodiment comprises the steps of: (a) blocking write access to 
data being reorganized; (b) identifying LOB table spaces that are related to the table space 
5 being reorganized; (c) concurrently creating a shadow data set for each of the LOB table 
spaces and a shadow data set for the table space and associated indexes; (d) loading rows 
into shadow data sets, and for each row loaded, reading LOBs from each of LOB table 
spaces relating to a loaded row and writing the read LOB to a corresponding shadow data 
set; (e) switching original data set with shadow data sets; and (f) allowing write 
10 operations related to data being organized to proceed. 

The present invention also provides for a system to reorganize a table space in a 
database. In an exemplary embodiment, the system comprises: (a) an identifier to 
identify LOB table spaces that are related to the table space being reorganized; (b) a 

15 shadow data set creator to concurrently create a shadow data set for each of the LOB table 
spaces and a shadow data set for the table space and associated indexes; (c) a shadow 
data set loader to load rows into shadow data sets, and for each row loaded, reading LOBs 
from each of the LOB table spaces relating to a loaded row and writing said read LOB to 
a corresponding shadow data set; and (d) a data switcher to switch the original data set 

20 with the shadow data sets. 
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Hence, the present invention's system and method reorganizes a table space (that 
contains a LOB base table) and related table spaces (which contain the LOB auxiliary 
tables). A major advantage of the present invention is that users of the REORG utility do 
not have to have knowledge of the related LOB table spaces. The LOB table space 
5 relationships are determined by the REORG utility. This allows for optimum 
performance because all of the table spaces, related to a LOB, would be reorganized at the 
same time. In addition, the LOB REORG would free unused space, as is done in a 
normal REORG, and the data would be loaded in the correct sequence. 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates the method associated with DB2's current REORG utility. 
Figure 2 illustrates a method associated with an exemplary embodiment of the present 
invention. 

Figure 3 illustrates an exemplary embodiment of the present invention's system for 
15 reorganizing table space in a database. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

While this invention is illustrated and described in a preferred embodiment, the 

invention may be produced in many different configurations. There is depicted in the 

20 drawings, and will herein be described in detail, a preferred embodiment of the invention, 

with the understanding that the present disclosure is to be considered as an 

exemplification of the principles of the invention and the associated functional 
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specifications for its construction and is not intended to limit the invention to the 
embodiment illustrated. Those skilled in the art will envision many other possible 
variations within the scope of the present invention. 

5 The DB2 REORG utility reorganizes a table space to improve access performance 

and to reclaim fragmented space. Figure 1 illustrates the method associated with DB2's 
current REORG utility. The current DB2 REORG SHRLEVEL REFERENCE utility 
works by: (a) allocating "shadow" data sets for each table space and its indexes - step 
102; (b) blocking write access to the data - step 104; (c) unloading rows from the original 
10 tables space - step 106; (d) sorting the rows - step 108; (e) loading the rows into the 
shadow data sets and extracting index keys for each row as it is loaded - step 110; (f) 
sorting the index keys - step 112; (g) building the indexes from the sorted index keys - 
step 114; (h) switching the original data sets with the shadow data sets - step 116; and (i) 
allowing write operations to proceed - step 118. 

15 

The present invention provides for a system and method for the simultaneous 
reorganization of a base table and any related auxiliary table spaces. The present 
invention determines which auxiliary tables are related to the base table and automatically 
includes their respective table spaces in the same invocation of the utility. This relieves a 
20 user from knowing the relationships between the base and auxiliary table spaces. 
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In the present invention, REORG determines which LOB table spaces are related 
to the table space being reorganized. It creates shadow data sets for each LOB table space 
at the same time it creates the shadow data sets for the table space and indexes. During 
the loading of rows into the shadow table space, for each row loaded, LOBs are read from 
5 each of the LOB table spaces related to the row being loaded, and written into the shadow 
data set for that LOB. 

Figure 2 illustrates a method associated with an exemplary embodiment of the 
present invention. The method of figure 2 comprises the steps of: (a) allocating "shadow" 
10 data sets for each of the table spaces and indexes - step 202; (b) blocking write access to 
the data - step 204; (c) unloading rows from the original table spaces - step 206; (d) 
sorting the rows - step 208; (e) loading the rows into the shadow data sets and extracting 
index keys for each row as it is loaded - step 210, and 

(i) for each row, identifying which columns represent LOB data- step 212; 
15 and 

(ii) for each LOB column, utilizing the rowid of the current row to read the 
LOB from its associated LOB table space and write the read LOB to the 
corresponding shadow data set - step 214; 

(f) sorting the index keys- step 216; (g) building the indexes from the sorted index keys- 
20 step 218; (h) switching the original data sets with the shadow data sets (including LOB 
shadows)- step 220; and (i) allowing write operations to proceed - step 222. 
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Figure 3 illustrates an exemplary embodiment of the present invention's system 
for reorganizing table space in a database. The system comprises: (a) an identifier 302 to 
identify LOB table spaces that are related to the table space being reorganized; (b) a 

5 shadow data set creator 304 to concurrently create a shadow data set for each of the LOB 
table spaces and a shadow data set for the table space and associated indexes; (c) a 
shadow data set loader 306 for loading rows into shadow data sets, and for each row 
loaded, reading LOBs from each of the LOB table spaces relating to a loaded row and 
writing said read LOB to a corresponding shadow data set; and (d) a data switcher 308 for 

10 switching the original data set with the shadow data sets. 

The present invention includes a computer program code based product, which is 
a storage medium having program code stored therein, which can be used to instruct a 
computer to perform any of the methods associated with the present invention. The 
15 computer storage medium includes any of, but not limited to, the following: CD-ROM, 
DVD, magnetic tape, optical disc, hard drive, floppy disk, ferroelectric memory, flash 
memory, ferromagnetic memory, optical storage, charge coupled devices, magnetic or 
optical cards, smart cards, EEPROM, EPROM, RAM, ROM, DRAM, SRAM, SDRAM 
or any other appropriate static or dynamic memory, or data storage devices. 
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Implemented in computer program code based products are software modules 
having: (a) computer readable program code allocating shadow data sets for each of the 
table spaces and indexes; (b) computer readable program code blocking write access to 
the data; (c) unload rows from the original table spaces; (d) computer readable program 
5 code sorting the rows; (e) computer readable program code loading the rows into the 
shadow data sets, extracting index keys for each row as it is loaded: 

(iii) for each row, computer readable program code identifying which columns 
represent LOB data; and 

(iv) for each LOB column, computer readable program code utilizing the rowid 
10 of the current row to read the LOB from its associated LOB table space 

and write the read LOB to the corresponding shadow data set; 
(f) computer readable program code sorting the index keys; (g) computer readable 
program code building the indexes from the sorted index keys; (h) computer readable 
program code switching the original data sets with the shadow data sets, including LOB 
15 shadows; and (i) computer readable program code allowing write operations to proceed. 
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CONCLUSION 

A system and method has been shown in the above embodiments for the effective 
implementation of a system and method for concurrently reorganizing logically related 
LOB table spaces. While various preferred embodiments have been shown and 
5 described, it will be understood that there is no intent to limit the invention by such 
disclosure, but rather, it is intended to cover all modifications falling within the spirit and 
scope of the invention, as defined in the appended claims. For example, the present 
invention should not be limited by software/program, computing environment, or specific 
computing hardware. 

10 

The above enhancements for reorganizing table spaces in a database and its 
described functional elements are implemented in various computing environments. For 
example, the present invention may be implemented on a conventional IBM PC or 
equivalent, multi-nodal system (e.g., LAN) or networking system (e.g., Internet, WWW, 
15 wireless web). All programming and data related thereto are stored in computer memory, 
static or dynamic, and may be retrieved by the user in any of: conventional computer 
storage, display (i.e., CRT) and/or hardcopy (i.e., printed) formats. The programming of 
the present invention may be implemented by one of skill in the art of database 
programming. 

20 
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